首页> 外文OA文献 >GPU-accelerated algorithms for many-particle continuous-time quantum walks
【2h】

GPU-accelerated algorithms for many-particle continuous-time quantum walks

机译:用于多粒子连续时间量子行走的GPU加速算法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Many-particle continuous-time quantum walks (CTQWs) represent a resource for several tasks in quantum technology, including quantum search algorithms and universal quantum computation. In order to design and implement CTQWs in a realistic scenario, one needs effective simulation tools for Hamiltonians that take into account static noise and fluctuations in the lattice, i.e. Hamiltonians containing stochastic terms. To this aim, we suggest a parallel algorithm based on the Taylor series expansion of the evolution operator, and compare its performances with those of algorithms based on the exact diagonalization of the Hamiltonian or a 4th order Runge–Kutta integration. We prove that both Taylor-series expansion and Runge–Kutta algorithms are reliable and have a low computational cost, the Taylor-series expansion showing the additional advantage of a memory allocation not depending on the precision of calculation. Both algorithms are also highly parallelizable within the SIMT paradigm, and are thus suitable for GPGPU computing. In turn, we have benchmarked 4 NVIDIA GPUs and 3 quad-core Intel CPUs for a 2-particle system over lattices of increasing dimension, showing that the speedup provided by GPU computing, with respect to the OPENMP parallelization, lies in the range between 8x and (more than) 20x, depending on the frequency of post-processing. GPU-accelerated codes thus allow one to overcome concerns about the execution time, and make it possible simulations with many interacting particles on large lattices, with the only limit of the memory available on the device. Program summary Program Title: cuQuWa Licensing provisions: GNU General Public License, version 3 Program Files doi: http://dx.doi.org/10.17632/vjpnjgycdj.1 Programming language: CUDA C Nature of problem: Evolution of many-particle continuous-time quantum-walks on a multidimensional grid in a noisy environment. The submitted code is specialized for the simulation of 2-particle quantum-walks with periodic boundary conditions. Solution method: Taylor-series expansion of the evolution operator. The density-matrix is calculated by averaging multiple independent realizations of the system. External routines: cuBLAS, cuRAND Unusual features: Simulations are run exclusively on the graphic processing unit within the CUDA environment. An undocumented misbehavior in the random-number generation routine (cuRAND package) can corrupt the simulation of large systems, though no problems are reported for small and medium-size systems. Compiling the code with the -arch=sm_30 flag for compute capability 3.5 and above fixes this issue.
机译:多粒子连续时间量子行走(CTQW)代表了量子技术中多项任务的资源,包括量子搜索算法和通用量子计算。为了在现实的情况下设计和实现CTQW,需要一种有效的汉密尔顿模拟工具,其中要考虑静态噪声和晶格波动,即包含随机项的汉密尔顿方程。为此,我们建议一种基于演化算子的​​泰勒级数展开的并行算法,并将其性能与基于哈密顿量或四阶Runge-Kutta积分的精确对角化的算法的性能进行比较。我们证明了泰勒级数展开法和Runge-Kutta算法都是可靠的,并且具有较低的计算成本,泰勒级数展开式显示了内存分配的额外优势,而与计算精度无关。两种算法在SIMT范例中也具有高度可并行性,因此适用于GPGPU计算。反过来,我们针对2粒子系统在越来越大的网格上对4个NVIDIA GPU和3个四核Intel CPU进行了基准测试,表明相对于OPENMP并行化,GPU计算提供的加速范围在8倍之间和(大于)20倍,具体取决于后期处理的频率。因此,GPU加速代码使人们可以消除对执行时间的担忧,并可以在大型格子上使用许多相互作用的粒子进行仿真,而只有设备上可用的内存有限。程序摘要程序标题:cuQuWa许可条款:GNU通用公共许可证,版本3程序文件doi:http://dx.doi.org/10.17632/vjpnjgycdj.1编程语言:CUDA C问题性质:多粒子连续演化嘈杂环境中多维网格上的时间量子游走。提交的代码专门用于模拟具有周期性边界条件的2粒子量子行进。解决方法:演化算子的​​泰勒级数展开。密度矩阵是通过对系统的多个独立实现进行平均计算得出的。外部例程:cuBLAS,cuRAND异常功能:模拟仅在CUDA环境中的图形处理单元上运行。尽管没有报告中小型系统的问题,但随机数生成例程(cuRAND程序包)中未记录的不当行为可能会破坏大型系统的仿真。使用-arch = sm_30标志编译代码以实现3.5和更高版本的计算能力可解决此问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号